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Preface 



America's public education systems are in the process of coping with a dilemma created by 
the current educational reform movement On one hand, many politicians, educators, and 
citizens agree that schemes such as merit pay and master teacher plans should be implemented to 
reward outstanding teachers. Twenty-six- states are presently involved with plans to reward 
outstanding performance (Cornett and Weeks, 1985). On the other hand, professional educators 
have historically been unable to agree on a satisfactory definition of outstanding teaching. 
Teaching has been characterized as so complex and: context-dependent that it may; not be 
reducible to a definition which is widely accepted within the teaching rank* I 

Although the public now appears willing to provide higher salaries through merit pay, career 
ladders, and master teacher plans, educators may not have the ability to respond/adequately. 
Teacher acceptance of these plans appears crucial to their success, but it may be impossible to 
articulate a definition of outstanding teacher which would make such plans work* It is incumbent 
upon the education profession to demonstrate that it can develop practical definitions which will 
be acceptable to those affected. The challenge will be to devise criteria of outstanding teething 
which go beyond the need to satisfy the popular outcry and serve to improve the teaching 
profession by helping these plans succeed. 

Whether the profession can respond adequately to this challenge may be demonstrated in 
twenty Maine school systems which are pilot testing the state's new certification law. At the time of 
this writing, sixteen of these systems have devised lists of criteria for identifying outstanding 
teachers (described as Master Teachers in the Maine law). After reviewing past efforts to define 
outstanding teaching, this study examines the content and nature of Maine's locally-devised 
definitions from three perspectives: (a) the criteria which have been proposed by local committees, 
(b) the acceptability of such criteria to a broad cross-section of Maine teachers, and (c) the 
comparability of what has been proposed to what appears acceptable. 

In order to facilitate comparison and analysis of the proposed lists, a framework is developed 
for categorizing and classifying the content and nature of criteria. This framework is derived from 
the categorization scheme utilized in Tennessee's Career Ladder Program and from the type of 
variables commonly used in research on teaching. It serves as the basis for describing the sixteen 
locally developed lists and the criteria which comprise them. 

The design and results of a survey which was used to assess Maine teachers' perceptions of 
suggested criteria are also described. The analysis of this survey provides information about the 
content and nature of criteria which are most liKe*y to be accepted or rejected by a broad 
cross-section of Maine teachers. 

The final analysis explores the congruence between the criteria preferences of local committees 
(with extensive teacher involvement) and the criteria preferences of Maine teachers. Based on this 
analysis, recommendations are made which should help the designers of future criteria lists. 

The author conducted a comprehensive study of the criteria which sixteen Maine school 
systems have proposed for selecting master teachers. This paper summarizes the study and 
addresses the design of outstanding teacher selection programs in light of the study's results. It is 
divided into four sections: (a) a summary of the problem, methodology, and results; (b) conclusions 
drawn from the results; (c) limitations of the study; and (d) implications of the study. 
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Summary 

Many of the state level educational reforms which fiave swept the country have included a 
component aimed at selecting outstanding teachers for special recognition, status, or pay. Educators 
have agreed that teacher acceptance of such plans is cmriatto their success and that the selection 
criteria utilized may affect their lev»l of acceptance, ftbwever, there has beenJiMe agreement on 
what constitutes a definition of outstanding teacher and little lias been learned about what criteria 
are suitable for selecting one. In order to make recognition plah$ successful, we quist learn more 
about outstanding teacher criteria and how teachers perceive tltem. v | 

The study was aimed at learning more about such criteria by. fpcusing on those proposed in 
sixteen Maine sites. Three research questions addressed various aspects of Maine's locally developed 
criteria. The first question was: , \ 

What content and characteristics describe the master teacher criteria lists utilized in the Maine 
pilot sites? 

A detailed examination of the criteria adopted in sixteen Maine sites was conducted. Criteria and, 
subsequendy, criteria lists were compared on: a) the basis of a conceptual framework derived from 
the content categories used in the Tennessee Career Ladder Program, h) the typfres of criteria used 
in research on teaching, and c) criteria inference level. ; * 

The categorization scheme used by Tennessee in its Career Ladder Program was deemed 
appropriate for describing criteria v .itent The Tennessee criteria list is familiar to many practitioners 
since it has been widely publicized, distributed, and discussed* It is typical of schemes which have 
been used to group teacher competencies and behavior for various purposes (Vincent, 1986). 

The Tennessee list is divided into six categories: planning for instruction, delivery of 
instruction, evaluation of student progress, classroom management, professional leadership, and 
basic communication skills. Most Maine lists contained criteria from alPpf these categories, but 
with varying emphasis. Items like those from Tennessee's Professional Leadership category (28%)^ 
and Classroom Management (16%) were also utilized extensively. The remaining three categories 
[Evaluation of Student Progress (8%), Planning for Instruction (8%), and Basic Communication 
Skills (7%)] were represented but accounted for a smaller portion of the lists. 

Several types of criteria can be used to address the same content The four types used in the 
study are based on descriptions of variables found in the literature relating to research on 
teaching. The process and outcome types are closely related to the process-product paradigm. 
They are widely used to describe the independent and dependent variables in studies pf teacher 
effectiveness (e.g., ERS 1983; Medley 1985; Ryans 1960). The input and context types are based on 
the presage and context variables described by Dunkin and Biddle (1974) and Gage (1978). The 
following example of criteria show how the same content can be expressed in terms of the four 
different types. 

Input criteria deal with predetermined teacher characteristics or qualities, e.g. The teacher 
knows how to write lesson plans. 

Process criteria describe what a teacher does. e.g. The teacher writes lesson plans. 

Outcome criteria deal with outcomes of a teacher's work. e.g. As a result of lesson planning, 
classes are well organized. 



Context criteria always tie the variable to some condition such as grade level, subject area, age 
of students, or community expectations, e.g. The teacher writes lesson plans which are appropriate 
for the grade being taught 1 

The Maine lists were primarily composed of process criteria (89%). Input criteria were 
sparsely used (8%) while context specific criteria ( 1%) and outcome criteria (0.4%) were practically 
non-existent 

In addition to differences in content and type, criteria differ in specificity or inference level. 
High inference items are abstractly stated in broad or general terms. They cannot be rated without 
making a judgment Low inference items refer to specific qualities, characteristics, behavtors, or 
outcomes. They are clearly observable or measurable. 

Maine criteria were usually worded in high inference or abstract terms (85%). Few of them 
could be applied without using a great deal of rater judgment 

The comparison of the sixteen Maine lists was relatively simple, since all but two were 
predominantly composed of criteria of the same type (process) and inference level (high). A 
comparison on the basis of content, revealed four distinct list models. 

The lists which fell into each model were remarkably similar to each other The Tennessee 
model (4 lists) closely paralleled the original Tennessee List The short comprehensive model (7 
lists) contained fcv;er items than the Tennessee model but was comprised of criteria which were 
similar to Tennessee's. The professional leadership model (4 lists) was dominated by items from 
the Professional Leadership category. The classroom instruction model (1 list) contained primarily 
Delivery of Instruction and Planning for Instruction items. 

The second research question was: 

What criteria content and characteristics ivill a random sample of Maine Teachers accept or 
reject for inclusion in a definition of outstanding teacher? 

This question was probed through the design and administation of a questionnaire. One hundred 
eighty-seven Maine teachers rated various content and criteria for acceptability and measurability. 
In general, these respondents indicated that most content topics were both acceptable and 
measurable. Nevertheless, there were differences in levels of acceptability and perceived measurability. 

Content from the Delivery of Instruction, Classroom Management, and Evaluation of Student 
Progress categories was most acceptable to .the respondents. In addition, they showed a preference 
for items which addressed student instructional needs, process and context type criteria, and high 
inference items. The least acceptable content was from the Professional Leadership, Planning for 
Instruction, and Basic Communication Skills Categories. Input type criteria and low inference 
criteria were also rated low in acceptability. 

Perceived measurability was highest for low inference items, outcome type criteria, and 
Planning for Instruction content Some content which was not included in the Tennessee list (e.g., 
years of experience, certification level, knowledge of subject) was also rated high in measurability. 
Perceived measurability was lowest for high inference items, input type criteria, and Basic 
Communication Skills content 

The third research question was: 

lb what extent will the content and characteristics which teachers approve be related to the 
content and characteristics of criteria utilized by the Maine pilot sites? 
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This question was examined in a comparison of c^t«iia'.|At|l^ed,iin' tticl^j^l^'^^^^i!^^;. 

survey respondents found to be most and least acceptable' and me^urable. Bbth con^itieiandl _ _J§| 

incongruities. were uncovered. ' , , ^ \\y -^|^\ ^ :;< ^ : 
When utilization level was compared to acceptability levels, agreement ^^^<i|ent^f 



Planning for Instruction and Basic CommuniratibnS^ ^i^^^r^^^^^l^ 
and the respondents. Disagreement was evident fof ltofession cqnten£3i^s the 

most frequently utilized but least acceptable category. .-*/tf " ■ > s ^ ; 

The sites and the respondents agreed on tHree criteria types, THey both -p^qf^'^pad 
criteria but did not favor input and outcome types. They disagreed on context criterra, which the 
respondents rated high. 

Agreement was also evident for inference level. High inference items were extensively 
utilized and were more acceptable than low inference items. 

Fewer congruities were evident when the perception of me^ur?bUity was consid ere<d;! 
was only one content category and one criteria type for which lohl utiliration, te^ 
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and perceived measurability levels matched. Basic Communication Skills coiiteht ^.^.^-^ 
criteria received low ratings on each. These were.also the only two chaiacfcnstics for jyhi^jfie^ 7 
was a match between acceptability and measurability. The chsunacteristics which were rated fiigaii 
measurability were rated low in both utilization and acceptability. 

Conclusions 

The preceding results led to conclusions regarding: teachers' criteria preferences, the umesoKftfi 
problem of measurability, and the effect of allowing criteria lists to be developed locally. 

Teacher Preferences 

The survey discovered some useful generalizations about teacher perceptions of mteria* 
from different content categories and different criteria types. Threg generalizations about teacher 
preferences are pertinent to those who wish to have their lists accepted on a b^ad basis. ^ 

First, the respondents agreed that most topics from the survey should be incited in an 
outstanding teacher definition. Thirty-one of the forty-two topics were acceptable to seveqty-|ive 
percent or more of the respondent?. All but four items were acceptable to a majority of them. 
Teachere appeared to believe that an outstanding teacher vvoufd be characterized by many 
different qualities. No single quality or group of qualities clearly stood out as most important This 
generalization was supported in the comments made by ten respondents who noted ijnujthe 
questionnaire margins that all categories were equally important 

Acceptability responses were also evenly distributed across content categories. All categories 
were rated in the acceptable range, although Professional Leadership placed last in every analysis 
of acceptability This led to the inference that teachers would favor a list which placed e^jal 
emphasis on all content, except Professional Leadership and some particular items from the 
"Other" category (certification level and years of teaching experience). 

Second, an important generalization emerged when individual criteria were examined 
without regard to their category. The most highly favored items related to subject matter, students, 
or classrooms. Respondents gave the lowest acceptability ratings to criteria that related to peripheral 
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issues Such as (a) extra cuiriralar activities, (btycH* of teaching experience, (c) cert ficadon status, a J \ldf 
(d) physical arrangement of the d^room, (e) use of ^tij^^^^[e^| 
respoiidents* view, outstanding teachers should be judged by what SieyJ^i^^ , 
children. They should not be judged ohvthe Iwim^ 
Leadership category, which deal with activity outside the d 

This generalization was also suppoiwjn the^ 
In general, the respondents preferred process criteria. H^e^er, 
the context referred to the smdents' varying needb*/^ 
needs of students were the most acceptable to fhe;SssK^ 
receive careful consideration in the develqpj^ 

Third, when respondents compared different criteria types, outrome item^ 
Items which related to student tests, assessment; and evaluation ranked last orn^t^^^lini 
acceptability. This presented a paradox: teachers pwfei^ cnte 
instructional needs, yet they did not favor criteria which repr^nt^^ 
well those needs are met The contradiction is similar to those discussed ^u^^e^: m|^&f ll, ^ 



Measurability 

Three sections of the survey indicated that there were inconsistencies between the kinds of 
criteria which teachers preferred and those which they perceived as easiest to measure, i^ialyscjs 
of criteria content, type, and inference level pointed to seventt^ j. 

Content The three content categories >vhich were ratedh^ 
the mid-range of measurability. However, many of the,ind^ . 
were among those rated as most difficult to measure. Some ^pqndeh^( j5) em^ 
point They commented that, in general, the qualities which were mostimportant to being an 
outstanding teacher were either difficult or impossible to measure (e.g., motivation,.^ 
intuitiveness). \ ^ V;^ \v 

Conversely, one of the categories (Planning for Instruction) which was rated ^as easiest to 
measure was among the least acceptable. This was also true for wveral partiailar ite 
certification level and years of experience. They were rated high in measurability but near |Ke 
bottom in acceptability. Thus selection of criteria k compounded by the feet that acceptable 
criteria were often seen as difficult to measure, and measurable criteria were often unacceptable. 

Type. On one hand, several process items were rated most acceptable and easiest to measure 
(among four types being compared} by the respondents). On the other hand, several process items 
which were rated high in acceptability were rated next to lowest in measurability* Conversely, four 
outcome items which were rated first or second in measurability were rated last or next $ last in 
acceptability. In the section of the survey which examined criteria type, some of the most 
acceptable items were perceived as least measurable; and some of the least acceptable were seen as 
most measurable. 

Inference Level When the respondents compared high and low inference items, the most 
acceptable were those which they perceived as least measurable. Teachers preferred high inference 
items, but perceived low inference items as easier to measure. Peterson and Peterson (1984) 
pointed out that evaluative criteria have been criticized for being high inference items which were 






not measurable. The results of this f tudy indicated that the same crititism ts H^ 
outstanding teacher criteria^The sixteen Maine Ibts were dominated by high inference- items, ^ s ^ v jM 
-which Maine teachers thoughrwcire least me^urabie; \ V'^ r ^z^$^^ *^4>~ ^^PM'-K- ^'5S 

In summary, it would be difficult to predict what teachers would perceive as bo^ 
and measurable. Although the study found an overall positive relationship betweert m^ 
and acceptability of content and cntena type,_there was one content ategory and 
which acceptability and measurability were* ranked at opp^ite ex 
relationship was clear; acceptability and measurabi^ 

Teachers did not prefer the specific, measurable, low inference items. Teachers appeared to agree 
that certain kinds of criteria were easiest to measure, but they weiie 'not riecesa^ly willingjto 
them. * , '/^ 

These results generate many questions about the feasibility of developing use^ 
basis of teacher preferences. Criteria used in the past were criticized for being, unspe^ific and 
unmeasurable, but these same criteria are the kind which teachers in the survey appear to :{>refc|V ^ , ^0 
Selection program developers who wish to use acceptable criteria will be faced with problems Jjjf 
measurement Somehow, they will need to convince teachers that specific criteria and outeome 
criteria can be applied fairly. , f 

Local Development 

The state's decision to rely on local groups to devise master teacher criteria appears to have 
been based on two assumptions: first, that local list development would lead to a variety of lists and 
approaches; and second, that teachers would know what was acceptable to o&ei: te^hejS. 
Involving local teachers in the selection of criteria might help ensure some variability and some 
degree of acceptability to teachers at large. ^* ^ - : 

The first assumption is not supported by the examination of such lists. All of the lists except 
one utilized criteria which were similar to those used in typic^evaluaftonfor^ 
same categories which Wise, Darling-Hammond, McLaughlin, and Berstein (19J§5) ' wd"i^i| 
( 1982) found in their reviews of teacher evaluation p^^dures.TTielocaJ lists were similar in their 
uneven emphasis of the categories utilized. Professional Leadership items dominated most of the 
lists. In addition, there was little variation in the way criteria were written. Eighty percent of all 
items were of the process type, and eighty-one percent were high inference criteria. 

Relying on local groups to devise master teacher criteria lists diet not lead to a great deal of 
variability from site to site. On the basis of this study, fifteen of the sixteen Maine lists appeared to 
be more similar than they were different This vtas particularly evident in the common reliance on 
high inference, process type, Professional Leadership criteria. 

The second assumption was supported by many authors, but it was not entirely supported by 
this study. Weeks (1985), ^chlechty, Jbslin, Leak, arid Hanes (1985), Riddle <|^5^axfd^||^s 
( 1984), for example, all encounged teacher involvement in criteria development Th& swdy did 
not refute claims that teacher involvement in a local process would increase the shared undent 
which Darling-Hammond, Wise, and Pease (1983) feJ*; was necessary, feacner participation in , 
criteria selection might increase acceptability for those who were directly involved, but it might 
not have an effect ori teachers who were not 
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The Maine lists were developed with extensive teacher involvement TSachere participated in 
the development of all sixteen lists analyzed in this study. For most of the li^ie^m^m^is^d 
at least a majority of the criteria selection group* However, the resulting criteria ^ 
emphasize the content which was most accepfble to the teachere surieyed. ^ 
content was acceptable to the respondents, the content which they favored laast received, the 
greatest emphasis in the local lists. * 

Limitations 

The findings and conclusions which are discussed above provide considerable infpimation 
for criteria list developers. Before examining the implications of these results, it is nec^sary to 
inform the reader of several limitations. First, the criteria lists were treated asjA6u^mey,^|e an 
exhaustive representation of the site's master teacher concept However^use of ^letfcm, |^otui^ 
which included (a) interviews, (b) recommendations from other professionals, (c) pej^j^^^h^ 
and (d) classroom observations indicated that unlisted factors may have been, w^[k>l^f j^Ttof 
the local definitions. These factors were not assessed in the survey. The reader must be cautioned 
about assuming that the lists represent all aspects of the teacher selection process. . 

Second, while the sample appeared to represent the demographic characteristics o| the 
population of Maine teachers, the response rate was 37%. Althoughime can only speculate about 
the non-respondents, it is clear that the respondent group does not include teachers who were not 
motivated by the topic and the survey. Whether this depressed or inflated acceptability scores 
remains open to conjecture. It is reasonable to assume that responses would have varied more 
widely if the opinions of unwilling respondents had been assessed. 

Third, the reliability of those parts of the survey which examined criteria items tnay have 
been affected by the wording of die items. When a criteria characteristic was being assessed, every 
attempt was made to keep the other two characteristics constant; but, this may not have been 
completely successful. 

Finally, the survey results were more homogeneous than expected. This led to the important 
findings that teachers were willing to accept a broad variety of content and viewed most content as 
measurable. On the other hand, this homogeneity made it difficult to distinguish between levels 
of acceptability and measurability. 

Implications 

The major conclusions" discussed above imply policy or procedural considerations for 
practitioners. They provide a basis for farther work in three areas: (a) developing an acceptable 
outstanding teacher criteria list; (b) addressing the problems associated with measurability; and (c) 
involving local educators in list development 

List Acceptability 

It appears from the survey results that teachers would favor a broad, comprehensive list which 
covers all aspects of teaching. Nearly all of the content presented in the questionnaire was judged 
acceptable. Teachers appeared to feel that an outstanding teacher was outstanding in all areas 
surveyed. If more areas had been surveyed, inose too might have been judged important 



9 

u 



In their comments, some respondents suggested that the questionnaire a id not, and could 
not, cover all aspects of outstanding teaching. Sample comments included: 
"There are too many variables to be listed T _ 
tt I don't sense a recognition here of the complexity of the issue* 

This questionnaire is for removed from the day to day occurrences and experiences of 
teachers in the classroom ." 

For some respondents, the whole list seemed off target^ They saw outstanding teaching as moje 
intangible than suggested by the questionnaire items. For example: 

"Teachers have been endowed with special caring and talents from above* 

"How do you critique intuitiveness, instincts, arid old fashioned gut feeling which we 
subconsciously employ in the classroom?*' 
. . a lot of magic goes unrecognized." 

In 1925, Anna Riddle indicated that teachers thought it was impossible to measure the "whole 
value" of a teacher's work* The survey results imply that many teachers still think that ho list can 
represent all of the characteristics of an outstanding teacher Teachers with this view are bound to 
be displeased with any list proposed. v H 

Furthermore, the homogeneity of the respondents' acceptability ratings indicates that it 
would be a mistake to use any list which emphasized one topic over others. It is dear rtiat teachers 
in the sample did not want to be rated on Professional Leadership criteria or otHer items which 
related to what they did outside of their classrooms or in the absence of students. However, except 
for a few unpopular items, all content appears to be almost equally important to teachers! 

In summary, the survey results indicate that in order to be/ accepted by; teachers, an 
outstanding teacher criteria list would need to (a) cover a broad range of topics, (b)- place equal 
emphasis on most topics, (c) avoid items like those from the Professional 'Leadership category, and 
(d) whenever possible include items which relate directly to students. 

Measurability 

The respondents ranked few topics high in both acceptability and measurability. Teachers 
often saw the most important items as difficult to measure. If list developers selected outstanding 
teacher criteria on the basis of what teachers would accept, they would end up with many 
unmeasurable items. Conversely, if they chose measurable criteria, they would have many 
unacceptable items, 

This paradox will not be easily resolved. List developers will need to work closely with 
teachers if they wish to arrive at a list which is both acceptable and measurable. This study did 
point out a few items which were acceptable and measurable to most teachers. Listdevelopers 
could begin with these particular items in order to earn the trust of participating teachers. 

The application of criteria has been a major problem in past recognition plans that foiled. 
1 wis study may help implementors select acceptable criteria, but this would only be the firat step. 
They would also need to develop evaluation procedures which were more acceptable than those 
used in the past In order to make these procedures work, it appears that low inference;and 
outcome criteria should be used. Low inference criteria have been .recognized by evaluation 
critics and by the teachers in this study as the most measurable kind. Unless they are utilized, 
application of criteria will be subject to rater bias and cries of unfairness will undoubtedly be 
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heard. Outcome criteria are needed in order to satisfy the calk for kcopdk^ 

and reformers, as well, often focus on student outcome criteria such as '^j^denf KM^^p; 

The problem with outcome and low inference criteria, hweyef, if ffi^ 
favored by teachers. Outcome items which would identify ou&tpdiife 
results and demonstrated student achievement were sunpng^ 
inference items were consistently rated low in acceptal^ 
dislike outcome and low inference criteria because they !^ 
their work. The literature suggests otherwise; teachercp^ 
did not feel that they would be interpreted feirly. If these ^ 

implementors must convince teachers that criteria will be appii^impaitially -^^'6l^^^yf' m 

Another reason to utilise low inference items is the need to make^lectionfg^^ 
defensible. If the selection plan involves monetary rewards or promotion 
administrative appeals can be expected from teachers who feel they are unjustly demi^^^^ju 
To defend their selection procedures, those responsible will have to show A^^e^C^fM^ 
objective and measurable. Unless the criteria are low inference, this will be diffi^]j^^pr ^ 

Given the many potential conflicts between acceptability ind ineasurab^ 
uncovered, the profession may not be prepared to deal with the reco^itiorrof certain teachers as 
more outstanding than others. Before recognition plans can be acceptable to teachers, it appears 
that the paradox of acceptability and measurability needs to be resolved. This study indicates that 
teachers would accept a definition based on what they do in their classrooms with children. 
However, they are not yet convinced that these processes can be measured objectively. 

Local List Development 

One respondent commented that a a just and fair" list could not be devised because "situations 
and conditions are so variable across the state" Anticipation of such criticism was pliably the 
main impetus for allowing local list development during the pilo' period. Allowing local pilot sites 
to develop their own criteria lists was expected to result in lists which varied according to local 
needs. However, the sixteen lists were not very different from each other or from lists produced in 
other places for teacher evaluation purposes. 

In several sites, this similarity was the obvious result of heavy reliance on the Tennessee 
model. In other sites, it may have reflected the difficulty of defining outstan(jing teaching without 
relying on some alreadv familiar base. The language of typical teacher evallfeton forms or of 
effective teaching literature may have been the only common base for developing master teacher 
criteria lists. 

Another factor contributing to this uniformity may have been the Master Teacher Certification 
pilot process. Each pil jt site was charged with the establishment of teacher support teams which 
would be partially staffed by master teachers. The master teachers on these teams would act as 
mentors for other teachers whn were being supported for recertification purposes. The hqavy 
reliance on Professional Leadership and Delivery of Instruction criteria may have occuireri in 
anticipation of the mentoring role which master teachers would be expected to play. These kinds 
of criteria may have been chosen because they reflected the s>< *lls needed to help other teachers in 
their classrooms. 
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This presents an interesting dilemma. If master teachers are selected ^ to fi%friew rol^inithte 
schools* the criteria used to select them should be related to that role, fo^ 
the same criteria which would identify the persons 

teachers. The most outstanding teacher may not necessarily be the one who possesses the skills 
needed to work with other teachers as a mentor, evaluator, staff developed or curriculum leader. 
Criteria for recognizing outstanding teachers may need t> be different from cntena for selecting 
teachers to fill new roles such as those proposed under earner ladder plans - . \ 



Extensive teacher participation in the selection of MaineVOTteria was expected to 



their acceptability to other teacheis. However, it did not result in the seledioh\o%the^most 
acceptable list content identified by die survey. As pointed out previously, the sixteen s^ 
emphasized professional leadership criteria which were least acceptable "to a broad cross^section 
of Maine teachers. - - * * - " ""W" . \- 

The ultimate contribution of local; list development may not have, $*e^;<^^ 
variability It may have simply been the involvement of teachers. It may hsiye been ..a^y^^w^^^^ 
teachers a feeling of control over their own profession, time ami effort could have bwn sav^ if a; 
statewide list of acceptable criteria had been developed and disseminated, but ihis approach 
would have reduced opportunities for participation at the local level. 



Summary of Implications 

It will be difficult to devise a list of criteria which completely defines an outstanding teacher. 
Teachers would expect such a definition to be extensive and possibly exhaustive. They see many 
criteria as equally important^ ? ^ > " J • V 

In addition, many of the qualities required of an outstanding teacher are very Tow in 
perceived measurability. In particular, when an important and acceptable factor is reduced to a 
low inference measurable criterion, it is likely to become unacceptable; - f \; ^ v - 

Local list development will not necessarily lead to variability among lists, nor will it necessarily 
lead to criteria which are acceptable to a broad cross-section of teachers. It may, howcyCT^help 
garnet teacher support by giving them a role in their own recognition program. r - ^ 

Fortunately, some of the criteria content which were needed and adopted by Maine's pilot 
sites appeared to be both acceptable and measurable. These topics may provide a good starting 
point for future efforts. Nevertheless, this study indicated that problems of measurability will need 
continuing attention. Extensive collaboration between teachers and program developers will be 
needed if criteria lists are to improve and if application procedures are to be perceived as fair 
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